
468 Bibliography
architectures for arbitrarily deep residual neu-
ral networks. AAAI Conference on Articial
Intelligence, 2811–2818. 323
Chang, Y.-L., Liu, Z. Y., Lee, K.-Y., & Hsu,
W. (2019b). Free-form video inpainting with
3D gated convolution and temporal Patch-
GAN. IEEE/CVF International Conference
on Computer Vision, 9066–9075. 181
Chaudhari, P., Choromanska, A., Soatto, S., Le-
Cun, Y., Baldassi, C., Borgs, C., Chayes, J.,
Sagun, L., & Zecchina, R. (2019). Entropy-
SGD: Biasing gradient descent into wide val-
leys. Journal of Statistical Mechanics: Theory
and Experiment, 12, 124018. 158, 411
Chen, D., Mei, J.-P., Zhang, Y., Wang, C., Wang,
Z., Feng, Y., & Chen, C. (2021a). Cross-layer
distillation with semantic calibration. AAAI
Conference on Articial Intelligence, 7028–
7036. 416
Chen, H., Wang, Y., Guo, T., Xu, C., Deng, Y.,
Liu, Z., Ma, S., Xu, C., Xu, C., & Gao, W.
(2021b). Pre-trained image processing trans-
former. IEEE/CVF Computer Vision & Pat-
tern Recognition, 12299–12310. 238
Chen, J., Ma, T., & Xiao, C. (2018a). FastGCN:
Fast learning with graph convolutional net-
works via importance sampling. International
Conference on Learning Representations. 264,
265
Chen, J., Zhu, J., & Song, L. (2018b). Stochastic
training of graph convolutional networks with
variance reduction. International Conference
on Machine Learning, 941–949. 264
Chen, L., Lu, K., Rajeswaran, A., Lee, K., Grover,
A., Laskin, M., Abbeel, P., Srinivas, A., &
Mordatch, I. (2021c). Decision transformer:
Reinforcement learning via sequence modeling.
Neural Information Processing Systems, 34,
15084–15097. 398
Chen, L.-C., Papandreou, G., Kokkinos, I., Mur-
phy, K., & Yuille, A. L. (2018c). DeepLab:
Semantic image segmentation with deep con-
volutional nets, atrous convolution, and fully
connected CRFs. IEEE Transactions on Pat-
tern Analysis & Machine Intelligence, 40(4),
834–—848. 181
Chen, M., Radford, A., Child, R., Wu, J., Jun, H.,
Luan, D., & Sutskever, I. (2020a). Generative
pretraining from pixels. International Confer-
ence on Machine Learning, 1691–1703. 238
Chen, M., Wei, Z., Huang, Z., Ding, B., & Li, Y.
(2020b). Simple and deep graph convolutional
networks. International Conference on Ma-
chine Learning, 1725–1735. 266
Chen, N., Zhang, Y., Zen, H., Weiss, R. J.,
Norouzi, M., Dehak, N., & Chan, W. (2021d).
WaveGrad 2: Iterative renement for text-
to-speech synthesis. INTERSPEECH, 3765–
3769. 369
Chen, R. T., Behrmann, J., Duvenaud, D. K., &
Jacobsen, J.-H. (2019). Residual ows for in-
vertible generative modeling. Neural Informa-
tion Processing Systems, 32, 9913–9923. 324
Chen, R. T., Li, X., Grosse, R. B., & Duvenaud,
D. K. (2018d). Isolating sources of disentangle-
ment in variational autoencoders. Neural In-
formation Processing Systems, 31, 2615–2625.
343, 346
Chen, R. T., Rubanova, Y., Bettencourt, J., & Du-
venaud, D. K. (2018e). Neural ordinary dier-
ential equations. Neural Information Process-
ing Systems, 31, 6572–6583. 324
Chen, T., Fox, E., & Guestrin, C. (2014). Stochas-
tic gradient Hamiltonian Monte Carlo. In-
ternational Conference on Machine Learning,
1683–1691. 159
Chen, T., Kornblith, S., Norouzi, M., & Hinton, G.
(2020c). A simple framework for contrastive
learning of visual representations. Interna-
tional Conference on Machine Learning, 1597–
1607. 159
Chen, T., Xu, B., Zhang, C., & Guestrin, C.
(2016a). Training deep nets with sublinear
memory cost. arXiv:1604.06174. 114
Chen, W., Liu, T.-Y., Lan, Y., Ma, Z.-M., & Li,
H. (2009). Ranking measures and loss func-
tions in learning to rank. Neural Information
Processing Systems, 22, 315–323. 73
Chen, X., Duan, Y., Houthooft, R., Schulman,
J., Sutskever, I., & Abbeel, P. (2016b). Info-
GAN: Interpretable representation learning by
information maximizing generative adversarial
nets. Neural Information Processing Systems,
29, 2172–2180. 291, 301
Chen, X., Kingma, D. P., Salimans, T., Duan, Y.,
Dhariwal, P., Schulman, J., Sutskever, I., &
Abbeel, P. (2017). Variational lossy autoen-
coder. International Conference on Learning
Representations. 345
Chen, Y.-C., Li, L., Yu, L., El Kholy, A., Ahmed,
F., Gan, Z., Cheng, Y., & Liu, J. (2020d).
UNITER: Universal image-text representation
learning. European Conference on Computer
Vision, 104–120. 238
Chiang, W.-L., Liu, X., Si, S., Li, Y., Bengio, S.,
& Hsieh, C.-J. (2019). Cluster-GCN: An ef-
cient algorithm for training deep and large
This work is subject to a Creative Commons CC-BY-NC-ND license. (C) MIT Press.